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ABSTRACT: 

A circuit for pattern matching on the frequency axis comprises a first transformation section 
having an adaptive filter generally identical to an adaptive finite impulse response filter with its 
delay elements having been replaced by all-pass filters, for performing frequency transformation 
on input pattern data from said memory means, a second transformation section having an 
adaptive filter generally identical to an adaptive finite impulse response filter with its delay 
elements having been replaced by all-pass filters, for performing frequency transformation on 
reference pattern data from the memory, and a control section for calculating a distance between 
the output data from the first transformation section and the second transformation section, 
controlling parameters of filters of at least one of the first and second transformation sections so 
as to minimize this distance, and generating a pattern matching residual output on the 
frequency axis based on the minimum distance. 
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© Pattern matching system. 



CD® A circuit for pattern matching on the frequency axis comprises a first transformation section having an 
^adaptive filter generally identical to an adaptive finite impulse response filter with its delay elements having been 
©replaced by all-pass filters, for performing frequency transformation on input pattern data from said memory 
means, a second transformation section having an adaptive filter generally identical to an adaptive finite impulse 
response filter with its delay elements having been replaced by all-pass filters, for performing frequency 
transformation on reference pattern data from the memory, and a control section for calculating a distance 
between the output data from the first transformation section and the second transformation section, controlling 
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parameters of filters of at least one of the first and second transformation sections so as to minimize this 
distance, and generating a pattern matching residual output on the frequency axis based on the minimum 
distance. 
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PATTERN MATCHING SYSTEM 



BACKGROUND OF THE INVENTION 

5 This invention relates to a pattern matching system used in speech recognition equipment, more 
particularly to two-axis pattern matching on the frequency and time axes using linear predictive coefficients 
or the like. 

This invention also relates to a frequency transformation circuit, more particularly to a circuit for 
transforming the frequency structure of a signal which can be incorporated in the pattern matching system. 

70 When speaker-independent speech recognition apparatus performs pattern matching, it must first 
normalize individual differences which are present on both the frequency axis and the time axis. An 
example of a prior-art system of pattern matching on these two axes is disclosed in the paper "Speaker- 
Independent Spoken Word Recognition Based on Time-Frequency-Intensity Warping of Spectra", 
Nakagawa et al. The Transactions of the Inst of Sectronics and Communication Engineers of Japan./ 81/2 

rs Vol. J64-D No. 2. This method performs dynamic-programming matching (DPM) on the frequency axis and 
time axis of discrete output values from bandpass filters. 

A problem in the above pattern matching system is that since it uses features obtained from the output 
of a bank of bandpass filters, it has difficulty in handling the details of frequency patterns accurately. 
Another method uses the fast Fourier transform (FFT) instead of a filter bank, but this method has the 

20 problem of confusing voice pitch effects with vocai-tract effects, and can provide only inexact results. A 
difficulty associated with speech recognition is that speech signal contains rapid changes along frequency 
axis. 

25 SUMMARY OF THE INVENTION 



An object of the present invention is to solve these problems and provide a pattern matching system for 
a speech recognition equipement that can perform pattern matching on two axes accurately by a simple 
30 process. 

Another object of the invention is to provide a frequency transformation circuit with the capability to 
perform a variety of frequency structure transformations. 

To solve the problems stated above, in speech recognition system that performs pattern matching using 
linear predictive analysis coefficients, the pattern matching system according to one aspect of this invention 
35 comprises 

(a) a memory means for storing the linear predictive analysis coefficients of an input pattern and a 
reference pattern, and providing output of the data at a time specified by a command signal, 

(b) a first pattern matching section comprising 

(bl) a first transformation means having an adaptive filter generally identical to an adaptive finite 
40 impulse response (FIR) filter with its delay elements having been replaced by all-pass filters, for performing 
frequency transformation on the input pattern data from the above memory means, 

(b2) a second transformation means having an adaptive filter generally identical to an adaptive FIR 
filter with its delay elements having been replaced by all-pass filters, for performing frequency transforma- 
tion on the reference pattern data from the above memory means, and 
45 (b3) a control means for calculating a distance between the output data from the first transforma- 

tion means and the second transformation means, controlling the parameters of filters of at least one of the 
first and second transformation means so as to minimize this distance, and generating a pattern matching 
residual output on the frequency axis based on the minimum distance, and 

(c) a second pattern matching section for performing pattern matching on the time axis by dynamic 
so programming according to the residual from the control means and generating a residual output on the time 

and frequency axes, and for giving a command signal to the memory means. 

In a pattern matching system described above, the memory means transfers to the first pattern 
matching section and the second pattern matching section linear predictive analysis coefficients (this term 
being defined as coefficients obtained through linear predictive analysis, in this specification) such as linear 
predictive coefficients of the input pattern data and reference pattern data at a time indicated by a 



3 



0 290 190 



command signal from the second pattern matching section. Frequency transformation is performed there 
through adaptive control of the delays (in the delay elements) of at least one of the first transformation 
means and second transformation means by the control means (for example, by a distance calculation 
section and a parameter decision section described later). 

s For example, if only the delays in the first transformation means are controlled, the control means 
calculates a distance between the output data of the first and second transformation means, controls the 
delays in the first transformation means so as to minimize that distance; and transfers the minimum- 
distance pattern matching residual on the frequency axis to the second pattern matching section. Based on 
the residual on the frequency axis, the second pattern matching section executes pattern matching on the 

w time axis by means of dynamic programming (DP) and generates final output of the residual from pattern 
matching on both the frequency axis and the time axis. 

In this way it is possible to solve the problems of the prior art stated earlier by simultaneously 
performing pattern matching on the frequency axis using linear predictive analysis coefficients, for example, 
as features and performing pattern matching on the time axis by DP matching. 

rs According to another aspect of the invention, there is provided a circuit for pattern matching on the 
frequency according to this invention comprises: (a) a first circuit having a first memory means for storing a 
first constant series corresponding to a frequency transformation parameter value, which performs fre- 
quency transformation by a multiply-add operation on linear linear predictive analysis coefficients of an 
input pattern and the first constant series, and calculates the resulting distance from the linear predictive 

20 analysis coefficients of a standard pattern on which frequency transformation has already been performed; 
(b) a second circuit having a second memory means for storing a second constant series corresponding to 
the above parameter value, which calculates changes in the frequency-transformed linear predictive 
analysis coefficients induced by this parameter by performing a multiply-add operation on the linear 
predictive analysis coefficients of the input pattern and the second constant series; and (c) a third circuit for 

2S updating the parameter above so as to minimize the distance based on the outputs of the first circuit and 
the second circuit, and generating as output a pattern matching residual on the frequency axis based on the 
minimum distance. 

In a pattern matching circuit of the above structure . the first circuit (for example, a frequency 
transformation and distance calculation circuit described later) performs a multiply-add operation on the 

30 linear predictive analysis coefficients of the input pattern and a first constant series obtained from the first 
memory means corresponding to a parameter vafue set by the third circuit thereby calculates the 
frequency-transformed linear predictive analysis coefficients, and also calculates the distance between 
these coefficients and the linear predictive analysis coefficients of a reference pattern, which have already 
been frequency-transformed. At the same time, the second circuit performs a multiply-add operation on the 

35 linear predictive analysis coefficients of the input pattern and a second constant series obtained from the 
second memory means corresponding to the parameter value set by the third circuit, thereby calculating 
the variations in the linear predictive coefficients induced by frequency transformation according to this 
parameter. The third circuit (for example: a residual detection circuit a updating circuit decision circuit, and 
switch) updates the parameters of the first and second memory means on the basis of the distance output 

40 from the first circuit and the variations output from the second circuit so as to minimize the distance. As a 
result the first and second memory means provide first and second constant- series corresponding to the 
updated parameter value. When the updated calculation results attain a minimum value, the residual based 
on the minimum distance is output as the residual on the frequency axis. In this way pattern matching on 
the frequency axis is performed by storing first and second constant series corresponding to the parameter 

45 values in the first and second memory means, calculating the frequency-transformed linear predictive 
analysis coefficients as features by a multiply-add operation performed with these series, calculating the 
distance and variations, and adaptively controlling the parameter. The circuit can therefore be small and 
simple in structure, but can perform pattern matching on the frequency axis accurately. 

According to another aspect of the invention, there is provided a frequency transformation circuit 

so characterized by having a plurality of all-pass filters with a designated transfer characteristic, means for 
multiplying the output signals from these all-pass filters by tap coefficients, and an adder for adding the 
signals obtained from these means. 

With this structure, the parameters defining the transfer characteristics of the all-pass filters are selected 
according to a desired purpose, and the tap coefficients are set in accordance with desired sample values 

55 depending on the desired purpose. In consequence, a transformation is effected between the frequency 
structure of an input signal and the frequency structure of a time series obtained as an output signal from a 
unit impulse input. 

The frequency transformation circuit of the above structure has the ability to perform a variety of 
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frequency structure transformations. 

According to another aspect of the invention, there is provided a frequency transformation circuit which 
comprises storage means for storing parameters defining the designated transfer characteristics of a 
plurality of all-pass circuits, a parameter determining the size of a frequency transformation to be performed 
5 on an input signal, and constant values of a unit impulse series; and multiply-add means for multiplying and 
adding these constant values with the input signal. 

With the above structure, the constant values necessary for frequency transformation of the input signal 
are read from the storage means wherein they are stored. The multiply-add means performs a multiply-add 
operation on the input signal and corresponding constant values, as a result of which a frequency- 
w transformed output signal is obtained. 

The frequency transformation circuit of this configuration is simple in structure, and small-sized. 



BRIEF DESCRIPTION OF THE DRAWINGS 

15 

Fig. 1 is a block diagram showing a pattern matching system of an embodiment of this invention. 

Fig. 2 is a diagram of the internal structure of the SPM section of the embodiment in Fig. 1 . 

Fig. 3A and Fig. 3B show the internal structure of the frequency transformation sections in Fig. 2. 
20 Rg. 3C is a diagram combining Fig. 3A and Fig. 3B to show the frequency transformation sections. 

Fig. 4 is a block diagram showing an example of the parameter decision section 33 of Fig. 2. 

Rg. 5A is a shematic diagram of the internal structure of a filter for finding the variation g Kn) . 

Rg. 5B is a schematic diagram of a modification of the filter of Rg. 5A. 

Rg. 6 explains the pattern matching process performed in the DPM section. 
25 Fig. 7 is a schematic diagram showing another example of the structure of the frequency transforma- 

tion sections. 

Rg. 8 is a block diagram explaining the principle of operation of another embodiment of this 
invention. 

Rg. 9 is a schematic diagram of this embodiment 
30 Rg. 10 is a block diagram showing a modification of the frequency transformation circuit. 

Rg. 11 A and Rg. 11 B are characteristic curves showing experimental data the frequency transforma- 
tion circuit of Rg. 10. 

Rg. 12 is a block diagram for explaining the principle of operation of another embodiment of the 
frequency transformation circuit 
35 Rg. 13 is a block diagram showing the embodiment of Rg. 12. 



DETAILED DESCRIPTION OF THE EMBODIMENTS 

40 

Rg. 1 is a block diagram showing an embodiment of this invention. In this figure the numeral 1 denotes 
a memory for storing linear predictive coefficients obtained as features from an input speech signal (input 
pattern), 2 is a memory for temporarily storing linear predictive coefficients obtained as features from a 
reference speech signal (standard pattern), 3 is a spectrum pattern matching (SPM) section that performs 

45 pattern matching on the frequency axis based on the signals output from the memories 1 and 2, and 4 is a 
time pattern matching (DPM) section that performs pattern matching on the time axis, based on the output 
signals from the SPM section 3. by means of DP matching, and supplies the memories 1 and 2 with data 
m, k specifying time, so that the memories 1 and 2 produce the input data of the specific time. 
The operation is explained next 

so The memory 1 receives a set of linear predictive coefficients a^, = {ao (m> . ai (m) ap (m) } (m = 1,2, 

.... M; time) of the input speech signal on which pattern matching is to be performed, and sends to the SPM 
section 3 the linear predictive coefficients a (m) for the time specified by the signal m frornthe DPM_section 
4. Similarly, the memory 2 receives a set of linear predictive coefficients a (k) - { a o tk) , a i <k> , .... a p w } (k 
= 0, 1 , .... K; time) of the reference speech signal and sends to the SPM section 3 the linear predictive 

55 coefficients a (W for the time specified by the signal k from the DPM section 4. The SPM section 3 performs 
pattern matching on the frequency axis with regard to the linear predictive coefficients a (m) of the input 
speech signal and the linear predictive coefficients a (k) of the reference speech signal. The residual de that 
remains is sent to the DPM section 4. Based on the residual de from the SPM section 3, the DPM section 4 

5 
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performs pattern matching on the time axis by DP matching, and generates final output of the residual D of 
pattern matching on both the frequency axis and the time axis. 

The internal structure of the SPM section 3 is shown in Fig.. 2. The SPM section 3 comprises two 
frequency transformation sections 31 and 32. a distance calculation section 34, and a parameter decision 
section 33. The frequency transformation section 31 for the input signal receives the linear predictive 
coefficients a< m> and generates output of a set of linear predictive coefficients a^j resulting from frequency 
transformation according to a variable parameter a received from the parameter decision section 33. The 
frequency transformation section 31 in Fig. 2 receives the all-pass filter j>aramete,r or from the parameter 

calculation section 33. continuously controls the frequency, and outputs a(m) = {io(m), Ii(m) a q (m)}. 

The parameters a determine the amount of frequency variation. The frequency transformation section 32 for 
the reference speech signal receives the reference speech linear predictive coefficients a = { a o, a i , a 
p }, performs a frequency transformation by the same process as in the frequency transformation section 1 , 
and outputs the reference linear predictive coefficients af = { afo, a"i, aq}. The frequency 
transformation section performs only a standard frequency transformation. The parameter a 0 used for the 
transformation is fixed. In this embodiment, the standard transformation is a transformation to a psychologi- 
cal scale (met frequencies). 

Rg. 3A shows the structure of the frequency transformation section 31 of the input speech signal. The 
frequency transformation section 31 comprises all-pass filters 31 z, multipliers 31c. and adders 31a. The 
frequency transformation section 31 is identical to an FIR filter except that delay elements in an FIR filter 
have been replaced by the all-pass filters 31 z. The frequency transformation section receives linear 
predictive coefficients as tap coefficients. The impulse response of the all-pass filters 31 z is the set of linear 
predictive coefficients a< m) = {ao (rn) . Si (m) , ap {m) } after frequency transformation. The frequency 
transformation section 32 of the reference speech signal is similar. 

The general form of the transfer function of an all-pass filter is: 



/J 

a(z) = ±_TT (z' 1 -°V /(1 -*i z ~ l) (1) 



Rg. 3B shows an example of the structure of an all-pass filter. The order of the filter shown is N, with 
each section comprising adders 301 and 303, multipliers 304 and 302 for the parameter a. and a delay 
element 305. In this embodiment the all-pass filters in each stage have the same order. If a first order all- 
pass filter is used, the relation between a^) and S( m , is: 




Bj^'Kz" 1 -*)/(! -rtz'MHdz (2) 



The frequency fl in the a m is transferred in a to: 
co = tan" 1 {(1 -a2)sin0/2 + (1 + c^cosfl} (3) 

Rg. 3C is a diagram combining Rg. 3A and Rg. 38 to show the frequency transformation section 31. 
The principle of operation of the frequency transformation section will be explained with reference to 
Rg. 3C. 

The numeral 41 denotes an input terminal for the parameter a determining the amount of frequency 
transformation 42 is an input terminal for a digital unit impulse series {1. 0. .... 0} t 43-0 to 43-P are input 
terminals for the linear predictive coefficients of the input signal, 44-0 to 44-P are multipliers, 31z-1 to 31z-P 
are all-pass filters, 31a-l to 31a-P are adders, and 47 is an output terminal. 

The all-pass filters 31z-1 to 31z-P comprise an adder 301. a delay element 305 a multiplier 302 a 
multiplier 304 and an adder 303. 

The frequency transformation circuit with this configuration operates as follows. Rrst a constant value o 
is received through the input terminal 41 and a unit impulse series (1. 0, .... 0) of P + 1 samples is received 
through the input terminal 42. At point 49-0, accordingly, the input is a unit impulse series, while at points 
49-1 to 49-P the input is a series of constant values comprising the constant value a and a unit impulse 
series. The multipliers 31c-0 to 31c-P and the adders 31a-l to 31a-P perform a multiply-add operation on 
the constant values at the points 49-0 to 49-P and the input signals received through the terminals 43-0 to 
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10 



43-P to obtain the sum of the products of the constant values at the points 49-0 to 49-P and the input 
signals received at the input terminals 43-0 to 43-P and this sum becomes the output at the output terminal 
47. The output is a time series representing the frequency transformed signal. 

After frequency transformation, the distance calculation section 34 in Fig. 2 calculates the distance 
(difference) d between the two sets of the linear predictive coefficients a( m) and a {k) by the following 



formula: 



d = »(■) " »(k) 

ia Q a Q , a x aj , . . . , a q - a q } 

= {d Qf dj, . . . , d q } (4) 



The distance d obtained as the result is sent to the parameter decision section 33. 

The parameter decision section 33 continuously updates the parameter a using the non-linear- 
parameter least-squares method to minimize the distance d. When the circuit shown in Rg. 3B is used as 
the all-pass filter, then if a (n) is the all-pass filter parameter a after the n-th update and d (n ) = {do< n >. di (n) , 
d q(n) } is the distance d after the n-th update, then the parameter a (n+1) after the next update is given by: 

W(n + 1) = <*(n) -Xo(n) 



25 



= z 

i*0 



l;(n)'g.(n)/£ 



{gj(n)r 



(5) 



30 

The parameter decision section 33 receives the distance 

d(n) = {d 0 (n). di(n) d,(n), .... d q (n)} 

and outputs the parameter a(n + 1) after the update. An example of the parameter determing section 33 is 
shown in Rg. 4. As illustrated, the correction amount 5a(n) is calculated at a correction amount calculator 
35 33i, from the output gi(n) of a gradient filter 33h and di(n). The parameter a(n + 1) after the update is 
determined at an adder 33k, from the present parameter o(n) stored in a register 33j and the correction 
amount 5a(n) from the correction amount calculator 33i. 

The gradient filter 33h determines the variation gi(n) of ai(m) with respect to a(n). 
An example of a filter for determining variations g^j of at with respect to a(n) is shown in Rg. 5A. This 
40 filter comprises all-pass filters (A(z)) 33z (substituting delay elements in an FIR filter), multipliers 33c, 
adders 33a, and a differentiating filter (A'(z)) 33d. The variations g i( „) are obtained as a time series from the 
impulse response of the filters shown. In Rg. 5A the differentiating filter 33d is located on the output side, 
but it could also be located on the unit impulse input side as shown in Rg. 5B. That is, the differentiating 
filter 33d is placed in a prestage before the FIR filter 33f configured similarly to the frequency transforma- 
45 tion circuit. The output of this differentiating filter 33d is a constant value corresponding to the input unit 
impulse and the input parameter a, and the variation g can be obtained by a simple multiply-add calculation 
using a ROM circuit More specifically, the variations gi< n > are given by a time series according to the 
impulse response of the filter: 

» _ . -Li i-1 f „ (■) . 

g i(n) £> a j J 

((z" 1 -* (n) >/(l -* (n)Z - l » j - 1 
5s (z" 2 " D/(l - <* (n) z -1 ) 2 dz (6) 



The initial value of a is set equal to the fixed parameter oo of the frequency transformation section 32 
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15 



20 



25 



30 



for the reference speech signal. 

When the difference between the parameter a (n) and the updated parameter a (n+1) is such that 
|{a t „*i)-a(n)}/a( n ,| < e (where e is a threshold value), the distance calculation section 34 generates the 

output de according to the following formula: 



This de is the residual left after adaptive frequency control to minimize the distance of the input linear 
predictive coefficients a. That is, d e is the residual resulting from pattern matching of the input signal and 
reference signal on the frequency axis. 

Let de(m, k) be the residual resulting from pattern matching on the frequency axis performed by the 
SPM section 3 on the linear predictive coefficients a^, of the input speech signal and the linear predictive 
coefficients a & of the reference speech signal. 

Fig. 6 shows how the DPM section 4 performs pattern matching on the time axis on the basis of this 
residual de(m, k). The values of de(m, k) are located at the grid points in Fig. 6. The DPM section 4 
determines the path from the point (1, 1) to the point (M, K) that minimizes the sum of the de(m, k): 

\ de(m, k). 

This sum is determined by a recursive formula, the partial sum D(m r k) being given by: 
D(ra, k) = ■ infDCn, k-l) + de(m, k) 

D(ra-1, k-l) + 2do(n. k) (8) 
D(ra-1. k) * de(ni, k) 



as where D(0, 0) = 0, D(m, 0) = « (m * 0), D(0, k) = «> (k * 0), de{M + 1. K + 1) = 0. The final pattern 
matching residual Oe is: 

De = D(M + 1, K+1V(M + K) (9) 
The DPM section 4 generates this residual De as its output. 

The pattern matching method of the DPM section 4 described above does not impose any restrictions 
40 on the path, but it is also possible to determine D(m, k) as: 

D(m, k) = rain fD(m-l, k-2) + 3de(m, k) 

D(m-1, k-l) + 2de(m. k) (10) 

D(ra-2, k-l) + 3de(a 9 k) 



45 



where D(0. 0) = 0, D(m, k) = » when m £ 0 and k * 0, D(m, k) = <» when m * 0 and k $ 0, and de(M + 1, 
so K + 1) = 0, and obtain the pattern matching residual 
De = D(M + 1, K + 1)/(M + K). 

The embodiment described above uses a method of controlling the parameter a of the SPM section 3 
in which all the initial values are a = ao (constant), but it is possible to hasten the convergence of the 
parameter a by using as the initial value the converged value of the previous stage. 
55 For simplicity, in the frequency transformation sections 31 and 32 of the SPM, the all-pass filters A(z) 
(31 z) in all stages had the same parameter a, but it is also possible to use an all-pass filter (31 z) having a 
differing configuration (parameter value or order) in each stage. Fig. 7 shows an example of such a 
configuration. 
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Similarly, it should be clear that different conditions can be used as the DPM path restriction conditions. 
In addition, although linear predictive coefficients were used as features in the foregoing description, 
linear predictive cepstral coefficients, linear predictive melcepstral coefficients, or other coefficients obtained 
by the linear predictive method could be used instead. The term "linear predictive analysis coefficient(s) w 
5 as used in the appended claims should therefore be construed to cover such alternatives. 

Furthermore, in the preceding embodiment the parameter a of the frequency transformation section 31 
of the input speech signal (input pattern), which is the delay (frequency transformation amount), was 
adaptively controlled, but it is also possible to adaptively control the delay of the frequency transformation 
section 32 of the reference pattern, or to control the delays of both frequency transformation sections 
io adaptively, although that would require an increased amount of computation. 

In the above configurations, pattern matching functions are provided for both the frequency axis and the 
time axis and pattern matching is performed simultaneously on these two axes using linear predictive 
analysis coefficients, so the matching process is accurate and simple. As a pattern matching method for 
use in speaker-independent speech recognition apparatus, this method can be expected to improve the 
;s recognition rate. 

Fig. 8 and Fig. 9 show another embodiment of this invention. Fig. 8 is a block diagram explaining the 
principle of operation of this embodiment Rg. 9 is a schematic diagram of a pattern matching circuit 
illustrating an embodiment of this invention. 

First the principle of operation of this embodiment will be explained with reference to Fig. 8. In Fig. 8 

20 reference numeral 51 denotes an input terminal for the linear predictive coefficients of an input signal (input 
pattern), 52 is a frequency transformation and distance calculation circuit, 53 is an a updating variation 
detection circuit, 54 is a residual detection circuit, 55 is an updating circuit, 56 is a decision circuit (DEC), 
57 is a switch (SW). 58 is an output terminal, 59 is an input terminal for input of linear predictive coefficients 
of a reference pattern (standard pattern), and 60 is a frequency transformation circuit. The time series a = 

25 {ao. a t , .... a p } of the linear predictive coefficients of the input signal through the input terminal 51 are input 
as tap coefficients to and frequency-transformed by the frequency transformation circuit which may be 
similar to that shown in Fig. 3A through Rg. 3C, which acts on a unit impulse series. The frequency 
transformation circuit is similar to an FIR filter, but its delay elements have been replaced by all-pass filters. 
Thus, the frequency transformation circuit of the frequency transformation and distance calculation circuit 52 

30 performs frequency transformation corresponding to a parameter a that defines the all-pass filters of an FIR 
filter to produce an output impulse series a._ 

The time series a = { a o. a i a p } of linear predictive coefficients of the reference signal is 

subjected only to a standard frequency transformation in the frequency transformation circuit obtaining a 
= { T c , T i af q }. In this case the parameter defining the all-pass filters has the fixed value a 0 . 

35 Next, the distance calculation circuit in the frequency transformation and distance calculation circuit 52 
generates the distance d between the frequency-transformed linear predictive coefficients a and If , where 

d = a - a* = {do. di d q } = {ao - ao. ai - ^ i aq - af q }. 

At this time, an impulse response g expressing the variation in a when the parameter a is moved is 
obtained from the a updating variation detection circuit 53, which comprises an FIR filter similar to the 

40 frequency transformation circuit. 

The parameter a is continuously updated so as to minimize the distance d. If a(n) is the parameter a 

after the n-th update and d(n) = {do< n) , di ln> d q (n) } is the distance d after the n-th update, then the 

parameter a(n + 1) after the next update is given by: 



This calculation is performed by the a updating circuit 55. The initial value, of a is the vaiue ao used in the 
frequency transformation of the linear predictive coefficients a of the reference signal. 

When the decision circuit 56 obtains from the parameter a(n) and the updated parameter a(n + 1) the 
result |{a(n + l) -a(n)}/a(n)l < e (where e is a threshold value), the residual detection circuit 54 outputs the 
result of the following calculation through the switch 57: 



45 



<x(n+l) = *(n) 




50 
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This d e is the residual left when frequency control is adaptively performed to minimize the distance 
between the linear predictive coefficients a of the input signal and the reference linear predictive coefficients 
a ; that is, it is the residual left after pattern matching on the frequency axis. The frequency transformation 
circuit in the frequency transformation and distance calculation circuit 52 can be the one identical to that 
shown in Fig. 3A to Fig. 3C. 

An explanation will now be given of the operation of the circuit in Fig. 9. which embodies the principles 
of operation just described for pattern matching on the frequency axis. Component elements in Fig. 9 that 
are identical to elements in Fig. 8 are indicated by the same reference numerals. The memory (MEM) 60a 
holds precalculated values of the linear predictive coefficients af obtained by frequency transformation of 
the linear predictive coefficients a of the reference signai and provides output of -aT in order to calculate a 
distance. The frequency transformation and distance calculation circuit 52 comprises a parameter memory 
(MEM) 52a. a multiplier (MPY) 52b. an adder (ADD) 52c ( and a register (REG) 52d. The a updating variation 
detection circuit 53 comprises a parameter memory (MEM) 53a, a multiplier (MPY) 53b, an adder (ADD) 
53c. and a register (REG) 53d. 

The residual detection circuit 54 comprises a multiplier (MPY) 54a. an adder (ADD) 54b, and a register 
(REG) 54c. The a updating circuit 55 comprises multipliers (MPY) 55a and 55d, adders (ADD) 55b, 55e, and 
55h, registers (REG) 55c, 55f, and 55i, and a divider (DIV) 55g. 

The operation of this embodiment is described next 

First the time series a of linear predictive coefficients of the input signal in input from the input terminal 
55 to the frequency transformation and distance calculation circuit 52. This circuit 52 uses the multiplier 
52b, the adder 52c. and the register 52d to multiply and add the input linear predictive coefficients with the 
series of constant values output from the memory 52a (which consist of the parameter a and an impulse 
series as described earlier). If the frequency-transformed reference data -a stored in the memory 60a are 
placed in the register 52d as initial values, the result of the multiply-add calculation described above is the 
distance d. 

At the same time, the linear predictive coefficients are input by the a updating variation detection circuit 
53. This circuit 53 uses the multiplier 53b, the adder 53c, and the register 53d to multiply and add the 
series of constant values output from the memory 53a and the input linear predictive coefficient series, 
yielding as output the variations g. The initial value of the register 53d in this operation is "0." 

Next, the residual detection circuit 54 uses the multiplier 54a to square the distance time series d 
received from the a updating variation circuit 53, and uses the adder 54b and register 54c (which is 
initialized to "O") to add the results and obtain the residual d e . The true matching residual is not obtained, 
however, unless the values of a converge. The control for this purpose will be described later. 

Next the a updating circuit 55 uses the multiplier 55a, the adder 55b, and the register 55c to multiply 
and add the distance time series d with the output g of the a updating variation detection circuit 53 to obtain 
Z d«g. At the same time, the multiplier 55d squares the output of the updating variation detection circuit 53. 
and the adder 55e and register 55f obtain the sum Eg 2 . Next the divider 55g performs the division E 
d«g/Eg 2 , and the adder 55h adds the previous value of a in the register 55i (at the start time this is the 
initial value) to obtain a new value of a. 

The decision circuit 52c decides whether the value of a obtained in this way has converged or not If it 
decides that the value has not converged, the updated value of a is used to change the values in the 
memory 52a and the memory 53d. If the range of is limited, the memory 52a and the memory 53d can be 
implemented as ROM circuits containing, in each memory, a calculated value for each value of a that can 
be selected. 

The above-described operation is repeated until the value of a converges. When the decision circuit 56 
decides that the value has converged, it closes the output control switch 57, and the output d e of the 
residual detection circuit 54 calculated with the value of a determined at this time becomes the output at 
the output terminal 58 representing the pattern matching residual on the frequency axis. 

In the above configuration, a first and a second series of constant values corresponding to parameter 
values determining frequency transformation quantities are held in memory means, multiply-add operations 
are performed on these series of constant values and linear predictive coefficients representing, features to 
calculate frequency-transformed linear predictive coefficients, distances, and variations, and parameter 
matching on the frequency axis is performed with adaptive control of the parameters, so the circuit can be 
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compact and simple in configuration, and precise pattern matching on the frequency axis can be performed. 
It is thus possible to provide inexpensive speech recognition equipment with an improved rate of 
recognition. 

A modification of the frequency transformation circuit incorporated in the above-described pattern 
5 matching system is shown in Fig. 10. 

in Fig. 10 the numerals 101-0, 101-1 101-P, 102-1, 102-2 102-N, and 103 denote terminals, 104 

is an oscillator, 105-1. 105-2 105-P are Nth-order all-pass-filters (APFs), 106-0, 106-1, 106-2 106-P 

are amplifiers, and 107 is an adder. The Nth-order APFs 105-1, 105-2, .... 105-P ail have the same transfer 
function: 

N r'-a k _ Z-'-a, Z^~a 2 Z"-q N 

k-i l-a„Z" 'l-a.Z' 1 l-a 2 Z _1 l-a N Z-' 



where the a k are constants such that \a k \ < 1 (k = 1 N) and A is the constant 1 or -1. Values in the 

range from -1 to 1 , corresponding to the desired frequency transformation characteristic, are input at the 
terminals 102-1, .... 102-N, and the value 1 or -1 is input at the terminal 103. The values from the terminals 

102-1 102-N are furnished to all the Nth-order APFs 105-1, 105-2 105-P to set the values at,o 2 

a N of the transfer function of the APFs 105-1, 105-2. .... 105-P. The input value at the terminal 103 is also 

furnished to all the Nth-order APFs 105-1, 105-2 105-P to set the value A is the transfer function of the 

Nth-order APFs 105-1. 105-2 105-P. The terminals 101-0. 101-1. 101-2 101-P receive the first 

sample data, the second sample data, the third sample data. .... the (P + 1)-th sample data of the digital 

signal to be frequency-transformed. The data from the terminals 101-0, 101-1, 101-2 101-P are supplied 

to the amplifiers 106-0, 106-1, 106-2 106-P to set the gain of the amplifiers 106-0. 106-1. 106-2 106- 

P. Next an output signal from the oscillator 104, which generates a digital unit impulse signal, is applied to 
the amplifier 106-0 and the Nth-order APF 105-1. P - 1 Nth-order APFs, namely the Nth-order APFs 105-2, 

105- 3 105-P. are connected in series to the output of the Nth-order APF 105-1. The outputs from the 

Nth-order APFs also become the inputs to the amplifiers 106-1, 106-2, .... 106-P. Accordingly, the amplifier 

106- 0 amplifies the output signal from the oscillator 104 with a gain set by the data from the terminal 101-0. 

The amplifiers 106-1, 106-2 106-P amplify the output signals from the Nth-order APFs 105-1. 105-2 

105-P with gains set by the data from the terminals 101-1. 101-2, .... 101-P. The output signals from the 
amplifiers 106-0. 106-1, 106-2, .... 106-P are received by the adder 107, which adds all the output signals 

from the amplifiers 106-0, 106-1, 106-2 106-P. The output from the adder 107 is a signal with a 

transformed frequency structure, derived from a digital signal having as its first sample, second sample, 
third sample, (P + 1)-th sample the data from the terminals 101-0, 101-1, 101-2, .... 101-P by a frequency 
structure transformation with a characteristic defined by values of at, a 2 , ... t a N and A in the formula for the 
transfer function of the APFs 105-1, 105-2. .... 105-P set by the values at the terminals 102-1, 102-2, .... 
102-N, 103. 

Fig. 11A and Fig. 11 B are characteristic curves showing experimental data obtained with the circuit of 
the above configuration. In both diagrams frequency (Hz) is shown on the horizontal axis and power (dB) on 
the vertical axis. First-order APFs with the transfer function A(Z~ ! -a)/(1 -aZ" 1 ) were employed as the Nth- 
order APFs. The input signal consisted of 129 samples of a 1001Hz sine wave sampled at a rate of 8kHz. 
As the parameters A and a that determined the frequency structure transformation characteristic, a value of 
1 was used for A and a value of -0.5 for a. As can be seen by examining the transformation from Fig. 11 A, 
which shows the power spectrum of the untransformed signal, to Fig. 11B, which shows the power 
spectrum of the transformed signal, it is possible to transform the frequency characteristic without changing 
the power. 

In this embodiment, once the values from the terminals 102-1. 102-2. .... 102-N, 103 are set they remain 
fixed during circuit operation, but it would be useful for a plurality of adaptive purposes to vary these values 
over time. Also, although the transfer functions of all the Nth-order APFs 105-1, 105-2. .... 105-P were 
identical, it would be useful for a plurality of adaptive purposes to vary the transfer functions of the Nth- 
order APFs 105-1. 105-2 105-P with respect to each other. 

As explained above, in the above configuration, there are many parameters for determining the 
frequency structure transformation characteristic, so a variety of frequency structure transformations can be 
effected. If provided with speech signal input, the circuit of the above configuration can also be used in 
speech signal processing apparatus. If provided with the impulse response of a filter as input, this circuit 
can further be used in a filter characteristic transformer, and if provided with speech recognition features as 



11 



0 290 190 



input it can be used in a speech feature transformer. 

Another modification of the frequency transformation circuit is shown in Fig. 12 and Rg. 13. 

It is seen from Rg. 3C, that the values at the points 49-0, 49-1, 49-2 49-P are constants. It follows 

that the all-pass circuits 31z-1, 31z-2 31 z-P in Rg. 3C can be configured as in Rg. 12 using ROMs 221- 

5 1 221 -P wherein the constant values are stored. In other words, the time-series output of the frequency- 
transformed signal can be obtained from the output terminal 223 in Rg. 12 by means of a multiply-add 
operation performed on constant values from the ROMs 221-1, .... 221-P and the input signals received 

from the terminals 222-0, 222-1 222-P. 

It follows that the circuit of this embodiment can be configured as shown in Rg. 13. In Rg. 13 the 

w numeral 201 denotes a ROM, 202 is a RAM, 203 is a parallel multiplier, 204 is an adder, 205 is a register 

(REG), and 47 is an output terminal. The ROM 201 corresponds to the ROMs 21-1 21 -P in Rg. 12. 

Locations Xq-p in the ROM 201 contain a sequence of values corresponding to the constant values at the 
points 49-0, 49-1 , 49-2, .... 49-P in Rg. 3C when the unit impulse series value is 1 . Locations Yo-p in the 
ROM 201 contain a sequence of values corresponding to the constant values at the points 49-0. 49-1. 40-2. 

15 .... 49-P in Rg. 3C when the next unit impulse series value is 0. Locations Zo-p in the ROm 201 contain a 
sequence of values corresponding to the constant values at the points 49-0, 49-1, 49-2, 49-P in Rg. 3C 
when the P + 1-th sample of the unit impulse series value is 0. The RAM 202 contains the input signal 
series from the terminals 43-0, 43-1, .... 43-P. 

Next the operation of the above-described circuit will be explained with reference to Rg. 13. 

20 Rrst the multiplier 203 multiplies Xo in the ROM 201 with H in the RAM 202. The result is added by the 
adder 204 to the initial 0 output from the register 205 and the sum is placed in the register 205. Next the 
multiplier 203 multiplies Xi in the ROM 201 with I in the RAM 202. The result is added by the adder 204 to 
the previous result in the register 205 and the sum is placed in the register 205. This process is repeated 
through Xq-p in the ROM 201 and L in the RAM 202, and a time series output No. 0 is obtained from the 

25 output terminal 47. Next the same process is performed on the Yo-p in the ROM 201 and H, I, L in the 
RAM 202, obtaining the next time series output if this process is repeated through Zo-p in the ROM 201 and 
H, I. .... L in the RAM 202, the entire frequency-transformed output is obtained from the output terminal 47. 

The above-described frequency transformation circuit can be made small in scale and simple in circuit 
configuration. 

30 

Claims 

1. A circuit for pattern matching on the frequency axis, comprising: 
35 (a) a first transformation means having an adaptive filter generally identical to an adaptive finite 

impulse response filter with its delay elements having been replaced by all-pass filters, for performing 
frequency transformation on input pattern data from said memory means; 

(b) a second transformation means having an adaptive filter generally identical to an adaptive finite 
impulse response filter with its delay elements having been replaced by ail-pass filters, for performing 

40 frequency transformation on reference pattern data from said memory means; and 

(c) a control means for calculating a distance between the output data from the first transformation 
means and the second transformation means, controlling parameters of filters of at least one of the first and 
second transformation means so as to minimize this distance, and generating a pattern matching residual 
output on the frequency axis based on the minimum distance. 

45 2. A pattern matching system on the frequency and time axes, comprising: 

(a) a memory means for storing linear predictive analysis coefficients of an input pattern and a 
reference pattern, and providing output of the data at a time specified by a command signal; 

(b) a first pattern matching section for performing pattern matching on the frequency axis, having 
frequency transformation means, and outputting a pattern matching residual on the frequency axis: and 

so (c) a second pattern matching section for performing pattern matching on the time axis by dynamic 

programming according to the residual from the first pattern maching section, thereby generating a residual 
output, and for giving the command signal to the memory means. 
3. A frequency transformation circuit comprising: 
a plurality of all-pass filters with designated transfer functions; 
55 means for multiplying the output signal from each all-pass filter by a tap coefficient; and 
an adder for adding the signals obtained from said means; 

wherein by selection of the parameters defining the transfer characteristics of said all-pass filters and 
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by setting of said tap coefficients in accordance with desired sample values which are desired to be 
frequency-transformed, the frequency structure of an input signal is transformed to the frequency structure 
of a time series obtained as an output signal from a unit impulse input. 

4. A frequency transformation circuit according to Claim 3, wherein said parameters are set so that the 
5 transfer functions of said all-pass filters are all the same. 

5. A frequency transformation circuit according to Claim 3, wherein said parameters are set so that the 
transfer functions of said all-pass filters differ from each other. 

6. A frequency transformation circuit according to Claim 3. wherein said parameters are varied over 
time. 

w 7. A frequency transformation circuit comprising; 

storage means for storing parameters defining designated transfer characteristics of a plurality of all- 
pass circuits, a parameter determining the size of a frequency transformation to be performed on an input 
signal, and constant values of a unit impulse series; and 

means for performing a multiply-add operation on said constant values and said input signal. 
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